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SUMMARY 


Under a new type of vector multiplication, a group 
(whose elements are vectors) is defined which is isomor- 
phic to the group of rotations. This allows a vector repre- 
sentation of rotations which has many advantages over the 
usual matrix or Eulerian angles approach— e.g., this vector 
representation avoids the need for trigonometric relation- 
ships and requires only three independent parameters. 

The simplicity of this vector representation is demon- 
strated by its use in several applications; in particular, an 
analytic solution to a least-squares rotation problem is 
presented. The differential equations defining the motion 
of a rigid body are also obtained in terms of a vector differ- 
ential equation. 
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1. INTRODUCTION 


The usual approach to the algebra of rotations is by means of matrix algebra, 
where the matrix of the rotation is defined by the product of simple rotation mat- 
trices. Each of the simple rotations is about a coordinate axis and is uniquely 
determined by an angle. The most general rotation is uniquely defined by three 
angles (Eulerian angles) provided that the axis of each rotation is known. Since 
the convention of defining Eulerian angles varies considerably in the literature, 
the product matrix is ambiguous when only three angles are known. The angular 
or matrix approach to rotations is complicated further in that it requires the 
evaluation of trigonometric functions, which necessitates the use of tables or a 
computer. 

The approach here is to define a group, whose elements are three-dimensional 
vectors, which is isomorphic to the group of rotation matrices. The algebra 
of rotations can then be represented by the algebra of vectors (dot and cross 
product plus a new vector product which will be defined later), and does not re- 
quire the evaluation of trigonometric functions. 

A comment on notation: Matrices and vectors will be denoted by capital 
English letters and their elements by small English letters with subscripts. 

All vectors are considered to be column vectors, and a superscript T is used 
to denote the tranpose of a vector or matrix. A superscript of -1 represents 
the inverse matrix. Primes will be used to denote the transform of a vector. 
Vector notation, including that for the dot and cross product, is used in this 
paper merely to simplify the algebraic relationships existing among the .com- 
ponents of various vectors, and is not intended to provoke physical interpretation. 
For example, even though two vectors may represent the same physical quantity, 
they are not regarded as equal unless they are expressed in the same coordinate 
system. Similarly, the notation U - Vx W may be used (even though V and W are 
expressed in different coordinate systems) merely to indicate that the components 
of U are formed from the components of V and W according to the standard equa- 
tions for cross products. It will be assumed throughout that all coordinate sys- 
tems are right-handed orthonormal systems. 


2. VECTORS DEFINING ROTATIONS AND THEIR ALGEBRA 

If R is the matrix of a rotation then [1] and [2] show that R may be expressed 
by 


'l o o\ / -i 

R x (<9) = cos <9 [ 0 1 0 | + (l-cos6>)[ x x x 2 

vO 0 1 


X 1 X 2 X 1 X 3 


3 \ 


X 2 X 2 X 3 


i + sin 6 -x 


X 1 X 3 X 2 X 3 X 3 


/ 
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where X = ( x r x 2 , x 3 ) T is a unit vector defining the axis of rotation and 0 is 
the angle of rotation. Unless otherwise stated it is assumed here that X has been 
selected so that 0 < 6 <tt. This can always be done since R x (< 9 ) - R _ x (-0) and 
R x (n + #) = R_ x (rr-0). 

The following expression for R may also be found in [1] : 


where 


R = (B T )~ 1 B , 


x 3 tan 2 


0' 

-x 2 tan ~2 


B = 


-x tan 2 


V 


2 tan 2 


e 

-Xj tan ~2 


Xj tan ~2 


( 2 ) 


(3) 


In order to eliminate the angle in equations (1) and (3) we assign a length 
(which is a function of 8) toX, in such a way that the sine and cosine of 8 can be 
determined by this length. Many such functions exist, and we choose tan <9/2 and 
sin 6/2 as examples. Thus, let 


6 0 

Y - tan ~2 X , Z - sin Tj X , where 0 < 8 < tt ; 
by standard trigonometric identities we obtain: 
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sin 


Vy 7 


y\ + y 2 




e 


COS 


VT+y 7 


= , 


sin0 = TTpr = 2 »' ZJ ( 1 - ZJ ) ■ 


cos 6 = 


1-Y 2 

1 +y 2 


= 1 - 2 Z 2 


2 


Hence the definition of R as given by equation (1), using Y and Z respectively, 
becomes 


R 


1+Y 2 


'1 0 o\ / y x 2 yiy 2 yiy 3 ^ 

(l-Y 2 )( 0 1 0 j+2 yjy 2 y 2 2 y 2 y 3 

\0 o \J \yjy 3 y 2 y 3 y 3 2 / 




( 4 ) 


and 


Z 1 Z 2 z i z 3 ' 


R = 


'10 0 

(l — 2Z 2 )[ 0 10 ] + 2 j z t z 2 z 2 2 z 2 z 


\0 0 1 , 


‘ Z 1 Z 3 Z 2 Z 3 



( 5 ) 


As a function of Y, equation (3) becomes 



( 6 ) 


Thus every vector Y defines a rotation matrix by equation (4), and every vector 
Zwith Z 2 < 1 defines a rotation matrix by equation (5). Hence equations (4) and 
(5) define mappings which map sets of vectors into the set of rotations. We have 
yet to verify that the mapping is onto, i.e., that for every rotation matrix there 
exist vectors Y and Z such that equations (4) and (5) define the matrix of the 
rotation. 

Let R be the matrix (with elements r.p of a rotation. Separating R into sym- 
metric and skew- symmetric parts, we have 


R 



+ 


J (R-R T ) 


* 


3 



comparing this with equation (5), we get the following relations: 


3 - 4Z 2 = o- , 


(7) 


where cr is the trace of R ; 


2 Vl - Z 2 Z 



( 8 ) 


2Z 2 Z 3 = k ( r 3 2 + r 2 3 ) ’ 

2Z 1 Z 3 = T ( r 13 +r 3l) ’ 

2Z 1 Z 2 = J ( F 21 + r i2) > 
1 - 2( z 2 2 + z 3 2 ) = r n , 

1 " 2 ( Z 1 2 + Z 3 2 ) = r 22 > 

1 - 2( z 1 2 + z 2 2 ) = r 3 3 , 
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These last three equations imply 


2 = 


'N 

(l + r n ~ v 22 ~ r 33)/^ 1 


2 2 = ( 1 + 


r 2 2 r il f 3 3 


)/ 4 ^ 


(10) 


2 = (1 + 


(* + r 33 r ll r 2 2 )/^ ‘ 


J 


Thus to express R in the form of equation (5), Z must satisfy nine conditions; 
however, by using the well-known properties of rotation matrices, it is easy to 
verify that these nine conditions are consistent. 

Equations (7) and (8) give a convenient expression forZ, namely, 



provided Z 2 / 1. If Z 2 = 1, then equations (9) and (10) may be used to determine 
Z. In this case, however, Z is not unique, since -Z also satisfies equations (9) 
and (10) if Z does. In practice, this ambiguity is of no consequence, since if 
Z 2 = 1, Z and -Z yield the same matrix. 

From equation (1), we find that cr - 1 + 2 cos 8 , which by equation (11) implies 
that Z 2 ^l. 

Hence the mapping defined by equation (5) is a mapping of three-dimensional 
vectors over the field of real numbers whose length is less than or equal to unity 
(denote this set of vectors by i) onto the group of rotation matrices. The mapping 
is also one-to-one except when Z 2 = 1 ( 0 - = - 1). 
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The vector Y may be obtained either directly from the matrix R (in a manner 
analogous to the preceding), or from the vector Z by using the relationship 


Y = 


1 

yi - z 2 


z . 


( 12 ) 


Thus, 


Y 


1 

l+o- 



(13) 


The vector Y is undefined when a - - 1 . This singularity may be removed if we 
agree to allow vectors of infinite magnitude whose direction Is given by a unit 
vector, X say. For such vectors, equation (4) becomes 


R = - I + 2XX t . 


(14) 


With this convention, Y may be obtained from the rotation matrix R with trace -1 
merely by observing that X = Z in this special case. Hence, X may be obtained 
from equations (9) and (10). 

Let 7 ] denote the set of all real three-dimensional vectors augmented by the 
vectors of infinite magnitude discussed above. Then equations (4) and (14) define 
a mapping from 77 onto the group of rotation matrices. 

Thus, either of the two sets £, or 77 may be used to parametrize the group of 
rotations. However, when vectors are used for this purpose it should be emphasized 
that they correspond to transformations. Hence, it is their algebraic properties as 
transformations that we are primarily concerned with, and not their properties as 
vectors (indeed £ is not even a vector space). It is of no consequence that 
two elements of £ or 77 may be equal in the sense of equality of transforma- 
tions but not equal in the normal sense of vector equality. On the other 
hand, the abundance of vector operations and their algebraic properties that 
are normally employed to simplify relationships between components of vec- 
tors can also be applied o vectors corresponding to a rotation. Indeed, it 
is the fact that the algebi i of vectors as rotations can be expressed in terms 
of standard vector notation that makes the vector parametrization appealing. In 
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order to distinguish vectors used to parametrize rotations from ordinary vectors 
which are transformed by rotations, we introduce a new type of vector which is 
obtained by defining a new equivalence relation among real three-dimensional 
vectors. Definition: Given a set 8 of .real three-dimensional vectors and a map- 
ping t which maps 8 onto the group of rotation matrices then two elements of 8 
are said to be equivalent if they map into the same rotation matrix. 

For the two sets £ and 97 defined above, vector equivalence is the same as 
vector equality except for vectors mapping into rotation matrices with trace of -1. 

Clearly, vector equivalence as defined above is an equivalence relation and 
separates the sets £ and 97 into disjoint classes. 

These equivalence classes are merely the inverse images under r of the 
rotation matrices. We shall call these equivalence classes "rotation vectors" 
and denote them in the same manner as ordinary vectors. 

The usual algebra associated with vectors is also applicable to rotation 
vectors. The well-known operations of scalar multiplication, vector addition, 
dot and cross products are performed on rotation vectors in the classical man- 
ner and the ordinary symbolism is used to denote these operations. When there 
is more than one vector in the equivalence class, the result of an operation is 
that obtained by using either vector. However, the same vector must be used 
throughout any one expression. 

Given two rotation vectors Yj and Y 2 , equation (4) defines two rotation ma- 
trices, say Rj and R 2 respectively. It is well-known thatR = R 2 R 2 is also the 
matrix of a rotation. Thus by a previous discussion there exists a rotation vector 
Y which defines R. This rotation vector Y can be obtained from equation (13) by 
forming the product R 2 in terms of Y x andY 2 , which gives the remarkably 
simple expression 


Y 



(Yi + Y 2 + Y t x Y 2 ). 


(15) 


If Zj and Z 2 are members of £ defining R t and r 2 respectively then the Ze£ 
defining R, such that R = R 2 R : , may be obtained in a similar manner i.e. by 
comparing the trace and skew-symmetric part of R 2 R x with the trace (a) and 
skew- symmetric part of R respectively as given by equation (5). This gives 

a = 3 - 4Z 2 = 3 - 4Z 0 2 , 
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Vi - z 2 z = 


(yrz? t/t z*-z t -z 2 )z a 


where 


' Z„ = ^1 -Z 2 2 Z.+ yi-Z.JZj + Zj x z 2 . 


Thus 


1 - Z 2 = 1 - z 0 2 = 



and 


r^z? yi-z 4 J - z, z 2 

— — 1 > 

Vi -z 2 

where the ‘sign depends only on the numerator since Vl - Z 2 > 0 by definition i.e. 

Vl - z 2 = cos (9/2 where 0 <6<tt . Hence, using the signum function, sgn (x), which 
is defined to be +1, -1 according as x is positive or negative, respectively (here 
zero is considered positive or negative) 


Z 


sgp ^Fz 7 yr Z*-Z X 'Z 


I 


(16) 


The above argument does not guarantee that the Z as given by equation (16) cpr- 
responds to R = R 2 Rj when Z 2 = 1. However, it is straightforward, but tedious, 
to verify that equation (16) indeed gives the Z defining R = R 2 R x for all and Z 2 
contained in £ corresponding to R t and R 2 respectively. 

Actually equation (16) may be obtained more readily from equations (12) and 
(15), and the identity 

Z = — — Y (17) 

Vl + Y i 
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which is a useful formula since it is also valid in the limit as Y 2 approaches 
infinity. 

Equations (15) and (16) suggest a new type of vector product. Definition: 

Let S be a set of vectors, let * be a binary operation on S and let t be a mapping 
of S onto the group of rotation matrices. Then * is said to be a rotation product 
if it is preserved by r, i.e. if r(V* W) = r(V)^( W), for every V and W in S. 

Thus for Zj, Z 2 in £, the productZ = Z 2 *Z 1 given by equation (16) is a 
rotation product. In a similar vein, for Y r Y 2 in 77, equation (15) defines a ro- 
tation product, except when Yj • Y 2 = 1, or when either of the rotation vectors has 
infinite magnitude. In these exceptional cases, we define the product Y 2 *Yj by 
forming Zj and Z 2 by equation (17), Z 2 *Zjby equation (16) and then Y 2 * Y^by 
equation (12). 

Let 77' and £' denote the two sets of rotation vectors (equivalence classes) 
defined by vector equivalence and the sets 77 and £ respectively. Then each of 
the two sets 77' and £', together with its rotation product, forms a group. 

Clearly 77' is clor ' under the rotation product. Also, since equation (17) 
implies that Z~ Z 2 * Z l has length less or equal to unity, £' is closed. The associ- 
ativity of the rotation product is easily verified. The identity of each set is the 
null rotation vector, and the inverse of V (in either set) is -V. Furthermore, the 
respective mappings of the two sets defined herein are isomorphisms onto the 
group of rotation matrices. Thus I (the identity rotation matrix) is the image of 
the null rotation vector and R~ 1 is the image of -V if R is the image of V . 

Actually, a one-to-one relationship between vectors and rotations may be 
defined without the concept of equivalence class. For example, when a = - 1 ad- 
ditional conditions on Z may be imposed so as to insure uniqueness. The choice 
of which additional constraints to select, however, depends on the preference of 
the user (a situation analogous to the many definitions of Eulerian angles). Thus, 
we introduced vector equivalence to emphasize that for the purpose of representing 
a rotation it does not matter which vector one selects out of an equivalence class 
for a 180 degree rotation. 

The group of rotation transformations can now be represented by either of 
the two groups £' or 77'. This representation has certain advantages over the 
usual matrix representation, stemming from the fact that the rotation is defined 
by three independent parameters, without recourse to trigonometry (the matrix 
approach requires nine elements, or the evaluation of six trigonometric functions 
plus two matrix multiplications). In many applications the vector representation 
requires fewer calculations than the matrix representation. For example, matrix 
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multiplication requires 27 scalar multiplications plus 18 additions; the rotation 
product of equation (15) requires only 13 multiplications and 10 additions. 

The choice of which vector representation to use depends upon the applica- 
tion. The formulas associated with the Y rotation vector are in general quite 
simple, requiring no radicals or trigonometric functions. Thus this vector repre- 
sentation is a valuable tool for hand calculations or for deriving theoretical 
results, but has the inconvenience of becoming infinite for all 180-degree rotations. 
The Z vector representation, on the other hand, is always finite, always defined 
(uniquely, except for its sign at 180-degree rotations), and the rotation product 
is valid for all rotations. However, the existence of the radical is inconvenient 
for hand calculations. The vector representation W = tan <9/4 X may prove to be 
useful since it combines some of the assets of both the Y and Z. In this case we 
have 


R = 


(l + W 2 ) 


(W 3 4 - 6W 2 + l) I + 8WW T + 4 (l - W 2 ) ( -w 


Y = 


1 -w 2 


W , 


1 + W 2 


r 

*3 

h 

0 

\ W 2 

_W 1 

r w . 



-w„ 


w. 


W 


Y = 


1 + Vl + Y 2 1 + Vi - Z 2 


3. COORDINATES OF A ROTATED VECTOR 

One of the most frequent uses of a rotation matrix is to find the coordinates 
of a vector after the rotation. The vector representation yields a useful formula 
for this application. Let V' be the image of V under the rotation and R the matrix 
of the rotation (i.e., V' = RV). If Y and Z are the rotation vectors defining R, then 
equations (4), (5), and (14) give V' as 


r 1 


YV 


- < 


1 + Y 2 


[(l -Y 2 ) V+ 2(V • Y)Y + 2Vx Y , Y 2 < 


V+ 2(V • X)X , Y 2 - 00 

V' = ZV = (l - 2Z 2 ) V + 2(V • Z)Z + 2 V x Z 
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Note that we have used the symbolism YV to denote the image of V under the 
rotation corresponding to Y. Thus the above equations define a vector multipli- 
cation in the same sense that RV is used to denote a matrix multiplication. 

4. ROTATIONS DETERMINED BY A VECTOR AND ITS IMAGE 

In many instances, one is given two vectors (V and V') and desires to deter- 
mine the matrix R such that V' = RV. If R is to be a rotation matrix, then we 
must have V' 2 - V 2 . For simplicity, we assume that V' 2 = V 2 = 1. The rotation 
taking V into V' is not unique; however, the vector representation of the "shortest 
path" rotation is immediately obvious— the axis of rotation is collinear with V'x V 
and the angle of rotation is cos -1 (V*V'). Thus 


Y 


x ,, i 

1 + V • V' V x 


V, 


V • V ' t - 1 


z 


1 

V2(l + V • V') 


V'xV , 




If V • V' = - 1 then Z is any vector satisfying the two conditions Z 2 = 1 and Z • V - 0; 
Y has infinite magnitude with direction defined by Z . 

5. ROTATIONS DETERMINED BY TWO VECTORS AND THEIR IMAGES 

A common practice in orbit theory is to construct a rotation that takes the 
xy -plane into the plane of the orbit and also takes the x-axis into the direction 
of perigee. This is a particular example of the following problem: Given v x , V 2 
(Vj andv 2 noncollinear), V/ , andV 2 ' such that Vj' 2 = V 2 , V 2 2 = V 2 2 , and 
• V 2 = Vj* V 2 , find the rotation that takes V y into V x ' and V 2 into V 2 . Here 
again, the vector representation of the rotation yields a simple solution. 

If V' = RV, then equation (2) implies thatB T V' = BV. By equation (6) and 
matrix multiplication, this condition can be written as 


V'+YxV' * V-YxV, 
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or 


V - V' = Y x(V + V' ) 

where Y is the vector representation ofR. 

Thus, the conditions V/ = RVj and V 2 ' = RV 2 may be expressed by 

Vj - Vj' = Y x(Vj + Vj') and V 2 - V 2 ' = Yx(V 2 +V 2 ') - 

Let 

Aj = V A - V/ , B 4 = V 4 + V/ (i = 1, 2) 

then the condition equations become 

k i = YxBj , (i = 1, 2) 

where A • Bj =0 and k t • B 2 =- A 2 • B x . It is immediately obvious that Y • A i = 0. If 
each side of the first equation is cross multiplied by the vector A 2 , we obtain 

A 2 x Aj = A 2 x (Y* Bj) = (A 2 • Bj) Y - (A 2 • Y) B x = (A 2 • B x ) Y ; 

thus if A 2 • Bj = - Aj • B 2 / 0 , we obtain a simple expression for Y , namely, 

v = V, V 2 '-V 2 -V,- (V.-V,') x(V 2 -V 2 ') . (18) 
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A more general expression may be obtained as follows. Cross multiply each 
side of the i* equation by B . ; this gives 


B a xAj - B/Y- (Bj • Y) B. • (i = 1 , 2) 

Therefore, 

B 2 -(B,xA 1 ) = Bj 2 B 2 ■ V -(B, • B^Bj ■ Y . 

B, ' (B 2 <A 2 ) = B *8, ' V - (B, • Bj B, • Y , 


and solving these two linear equations for Bj • Y andB 2 * Y yields 


(B^B^ ' [(B^ A 2 ) B t • B 2 A^ 

(BiXB 2 ) 2 

(B,xB 2 )- fe,-B 2 )A 2 -B 2 2 A,] 
(B^B,) 2 


Substitution of these last expressions into equation (19) gives the solution 
for Y, provided BjXB 2 / 0. 

If v ' = V x and/or V 2 ' = V 2 , the Z rotation vector is obtained from Y by 
equations (17) and (19) unless B^Bj = 0; otherwise, equations (17) and (18) pro- 
vide a much simpler expression forZ, namely, 


Z = 


sgn (V, * V 2 ' - V 2 • V 


V( V . • V 2 " V 2 • V lf + 1( V , -V,') X (V, -V 2 ') 


(V 1-v,') x(V 2 -V,') 


WhenV/ = V i and B 1 xB 2 - 0, Z - V.. 
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The above expressions provide another means of determining the vector 
representation of a rotation from the matrix of the rotation R. Merely choose 
two independent vectors (V x and V 2 ) and set Vj' = RVj , V 2 = RV 2 . 

Given V r V 2 , V; and V 2 ' such that Vj 2 = V{ 2 , V 2 2 = V 2 * 2 and Vj • V 2 

= V 1 ''V 2 , the matrix of the rotation, R, which takes Vj into Vj' and V 2 into V 2 can 
be determined from the vector representation obtained above and equation (4) or 
(5). A more direct approach, however, is to construct a right-handed orthonormal 
coordinate system from the vectors V x and V 2 and a second system from the 
vectors Vj' and V 2 ' in the same manner. The rotation taking the first system 
into the second will then take Vj into V/. The matrix of this rotation can be 
easily written as the product of two matrices, each of whose rows or columns 
are formed from the components of the constructed axes relative to some under- 
lying coordinate system. Let 


then 


u, = v,/|v, 


u,' = Vj'/lv/l 


w 




(V • vj) 

Vj' 2 



U 2 


R ! 


w/|w| , u 2 ' 

Ujxu 2 , u; 

(u.,u 2 ,u 3 ) , R, 


r/|r| , 

u, x u, , 

(U,',U 2 ',U 3 ') ; 


R = R 2 Rj” 1 . 
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6. A LEAST SQUARES ROTATION - DETERMINATION OF ATTITUDE 1 

Consider a vehicle with a local coordinate system which has been rotated 
from a fixed coordinate system. If the vehicle has equipment capable of deter- 
mining the direction (relative to the local coordinate system) of a point whose 
direction relative to the fixed coordinate system is known, then the rotation re- 
lating the local and fixed systems can be determined by two or more observations 
of such points. 2 If the directions were exact, then the rotation relating the two 
coordinate systems could be obtained from any two noncollinear observations 
by the methods of the previous section. In practice, however, exactness is not 
obtainable and in such cases one generally seeks the "least squares" solution. 

For the case at hand, we seek a rotation matrix R such that the scalar function 


0(R) 


n 


Z>-0‘ 


i=l 


is a minimum. Here V i and W 4 are vectors defining the directions of a point 
relative to the fixed and local coordinate systems, respectively. The matrix 
R— thus also the function <p(R)— has only three independent parameters. Any ro- 
tation which makes <p{ R) a minimum is either a solution of the three equations 
obtained by setting the partial derivatives of $ with respect to each independent 
parameter to zero or at a point where the partials do not exist. The use of 
Eulerian angles as the independent parameters leads to three condition equations 
which are quite complicated, tedious to derive, and must be dealt with as three 
scalar equations. The condition equations in terms of a vector representation 
of R, on the other hand, can be expressed as a single vector equation which is 
easily derived and has a solution obtainable by vector and matrix algebra. 

Since R is a rotation matrix, 0(R) may be written as 

w = 22 

i- 1 


V/ + W A 2 - 2Wj • (RVi) 


^This section is a solution to Problem 65' 1 by Grace Wahba in SIAM Review Vol. 7. No. 3, July 106^, 
which appeared during the writing of this paper. 

^The Orbiting Astronomical Observatory has this capability, where the points are known stars. 
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which as a function of Ye 77 gives 


0(R) 


= J| V i 2+W i 2 -YT^ (l-Y 2 )v i *W i+ 2(V i -Y)(W i -Y) + 2(W i xV i ) * Y 


(20a) 


when Y 2 <“>, and 


0(r> = £]{ v i 2+w i 2-2 _v i ■ w i + 2 (v i * x )c» i - x ) 

i= 1 


(20b) 


where X 2 = 1, when Y 2 - °°. 

In the first case, we have 




4 V 1 


(l +Y 2 ) 2 L-1 


i = 1 


(V,xW 1 )-Y-(V 1 -Y)(W 1 -Y)-V 1 -W 1 


(l+y 1 ) [(V, -Y).,, * (w i -Y)*!,, -(V.xWjh • 


L 

where the subscript j refers to the j th component of the vector. Thus a neces- 
sary condition for <p to have a minimum (at least among the rotations of less than 
180°) at some Y is given by the vector equation 


-Y + (V, -Y)^ -Y) +V; -W,]y = (l+Y 2 )£ [(V, -YjW, ♦ (W, -YjV, 


( 21 ) 


(For convenience, the range of summation is omitted hereinafter; always the 
letter i will be used for the summation index.) 

Taking the dot product of each side with Y, we obtain 


£> * Y )(Wi -Y) 



Y + 2V ■ • W 




v .) 


16 


\ 

When this last expression is substituted in the vector equation and the factor 
1 + Y 2 is divided out, we obtain 


a 


w. 




Y + 2V ; • W 


i] Y “ [( V i ' Y ) W i + ( W i ' Y ) V i +W i X V i]} = 0 •• : (22) 


The same substitution into equation (20a) gives 


^ (R) = ■ 2y 'ZI Wi x Vi (23) 

when Y satisfies equation (22). Hence, to minimize <p we must take the solution 
of equation (22) which makes Y • SWj xV ; a maximum. 

Equation (22) may be written in matrix notation as follows: 


(A T YI+B)Y = A 


(24) 


where I is the identity matrix, A is the vector XW i xV ; and B is a symmetric 
matrix, with elements 


b ik - "a, + v ik w ij) i r k . 

b n = • », -v.i »«) • 


(To see this most easily, write out equations (22) and (24) in component form.) 

If A - 0, thenY = 0 is the desired solution (there may be other solutions if 
det B = 0, but the value of cp will be the same for all solutions). If A / 0, then 
the solutions may be obtained as outlined below. 
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Multiplying each side of the matrix equation by the adjoint of A T YI + B, we 
obtain 

[det (A T YI + B)J Y = [adj (A T YI + b)J A , 

and a multiplication of this equation by A T yields the scalar equation 

jdet (A T YI + B)j A T Y = A T [adj (A T YI + B)] A . 

Denoting the scalar A T Yby A. , the scalar functions det (X.I + B) and A T [ adj (AI + B )] A 
as f (A.) and g(\), respectively, the above scalar equation may be written as 

\f(\) - g(\) = 0 . 


Note that the left-hand side, say h(/v), is a fourth-degree polynomial in A- , and 
that -f (-A) is just the characteristic polynomial of the symmetric matrix B. 

The solutions to equation (24) are obtained by determining the zeros of h(A) 
and solving the resulting linear equations. However, in the discussion following 
equation (23), it was determined that the maximum value of Y * 2(W i xVj , i.e., 
the largest zero of h(X) (denote this zero by A. 0 ), leads to the minimum value of 

0(Y). 

Since h(/V) is a fourth-degree polynomial, A 0 may be obtained analytically; 
however, a numerical iterative solution is probably more practical, since the 
zeros of h(A.) are easily bounded. We note, first of all, that</>(Y)is a non-negative 
function; therefore, equation (23) implies 


which provides an upper bound. 

Since B is symmetric, there exists an orthogonal matrix, P say, such that 
P 1 BP is diagonal. LetY' and U be vectors such that Y = PY'and A = PU. Then 
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in terms ofY' and U, equation (24) becomes 


(U T Y' I + B) PY' = FU , 


and a premultiplication by P 1 gives 


(U T Y' I +D) Y' = U 


where D is a diagonal matrix whose entries are merely the eigenvalues of B. 
(Without loss of generality, we may assume the eigenvalues to be arranged in 
increasing order, sayA^ £/V 2 <A . 3 .) Multiplying each side of the last equation 
by adj (u T Y ' I +d) and then by U T yields the scalar equation 


det(u T Y' I +D) U t .Y' = U T adj (U T Y' I +D) U . 


However, U T Y' ~ A T Y-A and D is diagonal, so that this last equation may be 
written in the form 


+;v i) + K a) ( k +X s) 


u i 2 [ K + k 2 ) { k+k 3 ) +u 2 { k + k i) { k + k 3 ) +u 3 ( x + \) (^ + x a) ' 


The left-hand side of the above equation is Af(A) and the right-hand side is g(A). 
We may now easily determine the sign of g(A) when A. = -\ , -A. 2 , -A.j (assuming 

A. < A. 2 < A 3 ), and thus also the sign of h (A.) = k f (k) - g (A.) at these points. From 
these considerations, it can be shown that A 0 is at least as great as -k lt and 
also that the largest zero of g(^) (provided g (A.) is not identically zero, i.e., 

A = 0) is less than or equal to ~k x , Hence, a lower bound of our desired zero 
(A. 0 ) is -A.J, which can be determined only by solving a cubic. But the largest 
zero of g(^) is also a lower bound, and this can by found by solving a quadratic. 

Thus far, we have obtained the minimum of <P{ R) for rotations of less than 
180 degrees. To obtain the minimum among all 180 degree rotations, we use 
the method of Lagrange for solving extremum problems with a constraint. The 
necessary conditions become 

-4&i-X)w ij + ( W i -X)v ij+2 , Xj =0 j = 1,2,3 (25) 
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where ( j . is Lagrange’s multiplier and X must be such that X 2 - 1. In terms of 
our previous notation, the above conditions may be collected into the single vector 
equation 



0 , 


where Bis the symmetric matrix introduced jn equation (24). This last equation 
can be satisfied if and only if p. is a root of the cubic equation 


det 


(§ - 2 £ V.-W,) 


I + B 


= 0 


but the roots of this equation are given by 


~2 - 2 



(k = 1, 2, 3) 


where the X k are eigenvalues of B. The condition equations thus become 


(B-\ k I) X = 0 , X 2 = 1 , 


and the solutions are just the unit eigenvectors of B. The value of <£>(R) at each of 
these solutions may be obtained by multiplying the j th equation of equations (25) 
by x ; and adding the three equations together. This yields 


= 4 £ ( v i' x ) 
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which when substituted into the expression for <£(R) (equation 20b) gives 


«*> = £ ( v i 2 * »i 2 + 2V , • »i) - h 

~~ L ( V > "*i) 2 + • 


Thus the eigenvector (with unit length) corresponding to the minimum eigenvalue 
(X x ) gives the minimum of 0 (R) for all rot .tions of 180 degrees. 

The minimum of <p(R) for rotations other than 180 degrees is given by equa- 
tion (23), and is 


= £ ( v i-*i ) 2 - 2X » 


and it was also shown that \ 0 > -\ x . Hence, the rotation giving the minimum <£(R) 
among all rotations is given by 


Y - (X Q I + B) 1 A, when f (\ 0 ) / 0 , 


or by a 180-degree rotation with axis of rotation X defined by 


(X Q I + B) X = 0 and X 2 = 1 , 


when f (X 0 ) - 0, where X 0 is the largest zero of h(X). 

To avoid inverting a near-singular matrix and dealing with large values of 
the components of Y when f (X 0 ) is near zero, the Z vector representation of R may 
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be obtained from equation (17) and the relationship 


adj I + B) 
= det(\ 0 I + B) 


adj I + Bj 


This gives 


sgn“ f (X- 0 ) r 

-B L r = =5 adj(\ 0 I + B) A , when f (\ 0 ) / 0 , 

yfM] 2+ AT[adj(X 0 I + B)] 2 A L ' 'J ' 


and 


I + B) Z - 0 and Z 2 - 1 , when f - 0 , 

where 

adj(\I+B) = \ 2 I + X[Tr (B) I - B] + adj(B) , 

f(\) = /V 3 +T r (B)\ 2 + Tr (adj (B)J k + det(B) , 

g(M = A T [adj (kl + B)] A , 

and Tr(B) denotes the trace of B. These last equations are easily verified by 
direct calculations. 

If n = 2 and Vj 2 = V 2 2 = W t 2 = W 2 2 , we conjecture that the following simple 

procedure gives the least squares solution. Let 

V t x V 2 Wjx W 2 

u, = , u,' = 

I V i x V 2 | 1 | Wj x w 2 
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V 1 tV 2 , »■ + », 
V, +V,| 2 |w, +w 2 


and obtain the rotation which takes each U x into U/ , by the techniques of the 
preceding section. Such a rotation takes the plane determined by V x and V 2 into 
the plane determined by W x and W 2 and insures that W x • RV X = W 2 • RV 2 . That such 
a rotation is the least squares rotation has not been verified by direct calculation 
due to the lengthy and laborious algebra involved; however, symmetry considera- 
tions allow no different solution to be proposed. 

We have also been very successful in solving equation (21) in the cases n > 2 
by successive substitutions, i.e., using the iteration 


y 


j+i 


“X; •Y i )W 1 + (W i -Y j )V, t » 1 *V 1 ] • 

2 [ (w * * v ‘) ■ Yj + ( Vi ' Y ‘) ( w *' Yi > 1-1 


Here, Yj is the j th approximation toy, and Y 0 is obtained as above using two of 
the V. and their corresponding images W i . In fact, in one case studied, the pro- 
cedure converged even when the angle of rotation was as large as 179 degrees. 

7. RELATIONS BETWEEN EULERIAN ANGLES, MATRIX OF ROTATION 
AND VECTOR APPROACH 

Since Eulerian angles have a wide usage (especially when the angles cor- 
respond to yaw, pitch, and roll) it may be convenient or necessary to transform 
the matrix or vector parametrization of a rotation into Eulerian angles. To do 
this, a convention or positive sense of rotation must be established. Here, we 
assume that the matrices of the simple rotations about each of the coordinate 
axes are given by 



0 

0 V 

/ cos 0 

0 

- sin 0\ 

Rj (0) = o 

cos 6 

sin 6 

I . R 2 (0> = 0 

1 

° 

Vo 

- sin 6 

cos 8 j 

\ sin 6 

0 

cos d j 


/ cos 6 sin 6 0 \ 

R 3 (6) = I -sin/9 cos 6 0 J 

\0 0 1 
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where R.(0) indicates a rotation of d about the i th axis. The matrix of any rota- 
tion R can then be written as _ _ 

1 X , A , o 

j = 1, 2, 3 

R = R k p 3 ) R j ( 6 i) R i ( 9 l) k = X ’ 2 ’ 3 

i t j/ k 


If i, j, and k are distinct, then by direct calculation one finds that 

sin0 2 = S ijk r ki , 005 02 811119! - — ^ i jk r k j ’ 

cos 0 2 cos 0 X = r k k ’ cos 0 2 sin 0 3 - — ^ i j k r j j , 

COS 0 2 COS 0 3 = r ! ! , 

where S k = 1 if ijk is acyclic permutation of 123 and S ijk = -1 otherwise. 
Thus if R is given, the angles are defined as follows: 

sin0 2 = S ijk r ki , 

COS $ 2 = ±V r i 2 i + r j 2 i = ± V r k 2 k + r kT ; 


if cos 0 2 f 0 , 

sin 0j = - S ijk r kj /cos 0 2 , cos 0 X = r kk ,/cos0 2 , 

sin0 3 = " S ijk r ji/ COS0 2 ’ COs6> 3 = r ii/ COS0 2 
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if cos 6 2 - 0, 8 x and 0 3 are subject only to the conditions 


sin (« 3 ± 8 ljk «i) = S . jir .. , 

cos (»3 * S , Jk «.) = 5 8 ijk r ik , 

where the upper signs are taken if sin 0 2 = 1 and the lower signs when 
sin d 2 - -l. 

The factorization is not unique even when cos & 2 / 0 since either choice of 
sign for cos 6 2 produces the same product matrix R. 

To factor R in the form 


R = E 1 (»3) *| («,) R i («.) 

(where the first and k jt factors are of the same form), let 


sin 6 r 




. 2 
ik 


= + 


fK 


+ r 


k i 


cos 


r ix 


if sin 0 2 0, 


sin - r i ■ /sin 6 2 , 
sin d 3 - r . A /sin 0 2 , 


cos 0J - 8 . . r ik /sin d 2 , 

cos 0- ” -8.. r, ./sin 6. , 

3 j i k i / 2 


where S.. - 1 if j i is in natural cyclic order and S . . - -1 otherwise. If 
sin & 2 - o, 6 l and are only subject to the conditions 


sinp 3 ±0j) - S jirkj 
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C °s (0 3 ±6 X ) 


± r 


kk 


where the plus sign is taken if r u > 0 and the minus sign if r . . < 0. 

Given Eulerian angles 6 X , & 2 , 6 3 as defined above, the vector representation 
can be obtained by forming the product matrix R, and then using the techniques 
of the previous sections. A more direct approach, however, is to use equations 
(15) or (16) twice, where 


Y. 

l 



Z 


sin “2 


- 77 < 8 < 77 , 


are the vectors corresponding to R^#) and Ej is the coordinate axis about which 
the rotation is taken (if n < 6 < 2n then the negatives of the above expressions for 
the range - v < 6 < 0 must be used) . 

To obtain the Eulerian angles from the vector Z, we have from equation (5) 


rij = 2[z. Zj , 


i ^ i 


r.. = 1 - 2Z 2 + 2 Zi 2 


where the upper sign is used when i j is in natural cyclic order and the lower 
sign otherwise. The angles are then obtained from the appropriate formulas 
above. 

8. EQUATIONS OF MOTION OF A RIGID BODY 

If R(t) is the matrix of a rotation which defines the orientation of a coordinate 
system (attached to a moving rigid body) relative to a fixed coordinate system 
then R satisfies the matrix differential equation 


R(t) r D(t)R(t) , R(0) = I , (26) 
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where 0 is a skew-symmetric matrix such that fiv = Vx o;(t) for all vectors V, 
and ai is the angular velocity vector. Writing this equation in the form 


RR " 1 = fl , 


and letting R - R k (9^ Rj {6^ R; (6 J , we obtain 


fl = RR -1 


R k R, R, RT> R,-> R k -‘ + R k R, R,-‘ R„-> + R k R„-l 


(27) 


Although this is a matrix equation, it represents only three independent component 
equations, since each of the product matrices on the right is skew- symmetric. 
These three independent equations can be collected into a single vector equation 
by using the well-known isomorphism between 3x3 skew- symmetric matrices 
and three-dimensional vectors, 



It is easy to verify that if R is a rotation matrix, then RS(V) R -1 — >RV. Also, from 
the definitions of Rj, it is straightforward to show that!?! (9) Rf x (<9) -—9E l , 
where Ej is the coordinate axis of rotation (this is also valid for rotations about 
any fixed line). From these considerations, and the fact that fi— • ’oi , equation 
(27) is equivalent to 


- = ( 0 3 ) R i (* a ) E i + *aM* 3 ) E i + ^3 E k 


= A<9 , 9(0) = 0 , 


(29) 


where A is a matrix whose columns are just the vectors R k Rj E. , R k E j , and E k , 
respectively, and 6 - 6 2 , 
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The matrix differential equation has no singularities, but requires the inte- 
gration of nine scalar functions. Equation (29), on the other hand, only involves 
three scalar functions, but the matrix A is singular when cos 6 2 =o for k / i , 
or when sin 8 2 = Ofor k = i. Thus, no set of Eulerian angles can be chosen so 
that 8 will be defined for all rotations. In fact, any set of Eulerian angles gives 
singularities for rotations as small as ninety degrees. 

To obtain the equations of motion expressed in terms of the Y vector, we 
merely differentiate equation (13), and make the proper substitutions using equa- 
tions (4) and (13), and the matrix differential equation. This gives 

y = -j [(co ■ Y) Y +Yx co + co] , Y(0) = 0. (30) 


To solve for co cross -multiply each side by Y, and subtract the resulting equation 
from the original. Thus, 



Y-Yx Y . 


The differential equation (30) has no singularities, but from the definition of 
Y, we know that solutions involving 180° rotations will diverge to infinity. For 
many applications, however, involving only moderate displacements of the moving 
frame, this will not present any difficulties. 

The differential equations in terms of the Z rotation vector can be obtained 
in a manner similar to that used for the Y rotation vector. However, a more 
direct approach is to use the identity 


whence, 


1 



1 Y • Y 

• y 

yi+Y 2 (l + y 2 ) yi + Y 2 


(31) 


(32) 
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From equation (30) we have 


Y • Y 



(33) 


Thus, combining equations (30) thru (33), we obtain 

Z = \ [zxoi+ yi -z 2 , Z(0) = 0. (34) 


Conversely, let Z(t) be a differentiable vector function such that over some 
interval, say t 0 <t<t 1 ,Z 2 <l and Z satisfies equation (34). Then Z(t) defines a 
rotation matrix R(t) by equation (5) andR is given by 


R - -4Z • ZI + 2 (zZ T + ZZ T ) + 2 Vl - Z 2 S(Z) ~ 2Z • z(l - Z 2 )“ 1/2 S(Z) 


where S(V) denotes the skew- symmetric matrix formed from the vector V by 
equation (28). Taking the dot product of each side of equation (34) gives 


Z • Z 


2 dt 


\a> • Z/T^Z 1 , 


or 


z • z(l-z 2 ) 1/2 


1 

2 w ' 2 . 


(35) 


since Z 2 < 1. When equations (34) and (35) are substituted into the expression 
for R we find 

R = a -z 2 ^a> T + ojZ t + S (zx <u + Vl - Z 2 co) - 2co • Zl] + z(Zx a>) T + (zx cw) Z T 


- a; * ZS( Z) 
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and direct calculation will indeed verify that R = fiR for t 0 < t < t r In fact, R = UR 
even if Z 2 = 1 provided Z has a derivative at this point satisfying equation (34) and 
equation (35) is valid in the limit as Z 2 approaches unity. For example, if co is 
a non-zero constant vector, then 


M t 


s in 


is a function satisfying equation (34) for -n < |w| t < v and indeed the matrix R defined 
by Z satisfies equation (26) over the same interval. 

,, Thus, both equations (30) and (34) define the motion of a rigid body over a 
wider range of allowable orientations than Euler's equations (equation 29) and 
only require the integration of three scalar equations which do not contain trigo- 
nometric functions. Furthermore, the results of the integration (especially Z) 
can be used directly, without having to generate the matrix, e.g. the coordinates 
of a vector relative to the fixed system may be obtained direct’y from the co- 
ordinates relative to the body system and Y or Z per Section 3; if Z x defines the 
orientation of the body at time t j relative to the body system at t - 0, and Z 2 
defines the orientation when t = t 2 relative to the body system at t = tj, then 
the rotation product Z = Z 2 * Z l gives the orientation at time t 2 relative to the 
body system at t = 0 for all Zj and Z 2 . Eulerian angles can also be obtained as 
described in the previous section. 

As with most differential equations, one would have to devote considerable 
time to equations (30) and (34) in order to describe completely the properties of 
the solutions. However, we mention only one common property of both equations, 
which is useful for approximating the solutions for small increments of time. If 
Y(0) = Z(0) - 0, then we note that the n th derivative of Y or Z at t = 0 contains 
the term o (n_1 >(0) (the (n-l) th derivative of oj at t - 0), and for the first two 
derivatives this is the only rrm. Thus, 


Y(h) 


If 

Jo 


ojdt + U 


Z(h) 


if 

Jo 


ox it + V 
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I " 

where U and V are of the order of h 3 . Hence, to second order, both solutions may 
be approximated by the integral of the angular velocity. 

9. CONCLUSION 

The significant advantages of the vector approach to rotations as presented 
here over other parametrizations is that the vector parameters can be obtained 
with ease from basic data, and need not be transformed to a new set in order to 
perform the algebra of rotations (the product of rotations and the product of a 
vector by a rotation). Not only is the need for evaluating trigonometric functions 
removed, but the singularities introduced when one attempts to obtain the polar 
form of a vector or factor a rotation matrix into simple rotations are also re- 
moved. To illustrate these last remarks, we cite one final important application 
of our vector approach to rotations. 

In orbit theory, it is customary to obtain the components of the position and 
velocity vectors by rotating a coordinate system in which the direction cosines 
of the angular momentum vector are given by E 3 = (0, 0, 1) T , into a fixed system 
in which these direction cosines are also known, say H - (h p h 2 , h 3 ) T . This is 
usually done by assuming H is of the form H - (sin i sin 0, - sin i cos 0, cos i) T ; 
thus the rotation is normally given by R = R 3 (-Q) Rj(-i). Unfortunately, this 
technique produces a singularity even in the trivial case i = 0 (no rotation re- 
quired). On the other hand, from Section 4 we find almost immediately that 


Z 




H x E, 



is the rotation vector taking E 3 into H. Thus, the two parameters z x and z 2 define 
the rotation uniquely for all i except i - it. 
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